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Abstract 

We extend first-order logic to include variadic function symbols, and 
prove a substitution lemma. Two applications are given: one to bounded 
quantifier elimination and one to the definability of certain Borel sets. 

1 Introduction 

A variadic function is a function which takes a variable number of arguments: 
for example, a function from N <N to N is variadic, where N <N denotes the 
set of finite sequences of naturals. In classical first-order logic, a language has 
function symbols of fixed arities. In this paper I will explore how variadic 
function symbols can be added to first-order logic. In so doing, we will also 
formalize the syntax of the ellipsis, • • • , which of course is closely related to 
variadic function symbols. 

To get an idea of the subtleties of the ellipsis, consider the following "proof" 
that 5050 = 385: 

1. We know 1 + • ■ • + 100 = 5050. 

2. We know 1 = l 2 and 100 = 10 2 . 

3. By replacement, l 2 H h 10 2 = 5050. 

4. Also, l 2 + • • • + 10 2 = (10) (10 + 1)(2 • 10 + l)/6 = 385. So 5050 = 385. 

Evidently, mathematicians implicitly impose some special syntax on the el- 
lipsis. This will be made explicit in the paper. 

Of course, we can already talk about unary functions N — > N which interpret 
their input as the code for a finite sequence. My hope is that some coding can 
be avoided by allowing variadic function symbols. 

I was led to investigate the syntax of variadic function symbols when I was 
investigating a certain class of subsets of Baire space and realized that I could 
characterize that class with the help of first-order logic extended by variadic 
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function symbols. The results are written up in Q]. Some of the basic results 
of this paper were first published there. 

Variadic functions are used in many programming languages. What little 
literature presently exists mostly seems to be in this context (for example, Byrd 
and Friedman and in the related context of A-calculus (for example Goldberg 

ED- 



2 Basic definitions 

Definition 1. 

• A first-order variadic language (or simply a variadic language) is a first- 
order language, including a constant symbol n for every n £ N, together 
with a set of variadic function symbols, and a special symbol • • - x for every 
variable x. 

• A structure (or a model) for a variadic language Jz? is a structure ^# for 
the first-order part of with universe N, and which interprets each n 
as n, together with a set of variadic functions N <N — > N, one for every 
variadic function symbol of ££ . If G is a variadic function symbol of Jz? , 

will denote the corresponding function G^ : N <N — > N. 

Definition 2. 

• (Terms) If _Sf is a variadic language then the terms of Jf, and their free 
variables, are defined as follows: 

1. If c is a constant symbol, then c is a term with FV(c) = 0. 

2. If x is a variable, then a; is a term with FV(x) — {x}. 

3. If / is an n-ary or variadic function symbol and u\, . . . ,u n are terms, 
then f(u±, . . . , u n ) is a term with free variables FV(ui)U- ■ -UFV(u n ). 

4. If G is a variadic function symbol, u, v are terms, and x is a variable, 
then 

G(u(0) ■ ■ - x u(v)) 
is a term, with free variables (FV(u)\{x}) U FV(v). 

• (Term substitution) If r,t are terms and a; is a variable, then the term 
r(x\t) obtained by substituting t for x in r is defined by induction in the 
usual way, with two new cases: 

1. If r is G(u(0) ■ ■ - x u(v)) then r(x\t) is 

G(u(0)--- x u(v(x\t))). 

2. If r is G(u(0) ■ ■ - y u(v)) where y ^ x, then r(x\t) is 

G(u(x\t)(0)--- y u(x\t)(v(x\t))). 
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Lemma 1. (Unique Readability) Assume Jz? has the following properties: 

1. Every symbol of Jz? is exactly one of the following: a left parenthesis, a 
right parenthesis, a logical connective, =, a constant symbol, a variable, 
an rt-ary predicate symbol for some n, an n-ary function symbol for some 
n, a variadic function symbol, or an ellipsis • • - x for some variable x. 

2. If some symbol is an n-ary function (resp. predicate) symbol and also an 
m-ary function (resp. predicate) symbol, then n = m. 

3. If some symbol is • • - x and ■ ■ - y , then x and y are the same variable. 
Then the terms of ^£ have the unique readability property. 

Proof. By the usual inductive argument. □ 

Henceforth, we will always assume every language satisfies the hypotheses 
of Lemma [T] Thus the free variables of a term are well-defined, as is term 
substitution. 

Example 2. (Finite Sigma Notation) If Jz? is a first-order language, we can 
extend it to a variadic language Jz?s by adding a variadic function symbol £ 
(along with ellipses and numerals). In practice, the term £(m(0) • • - x u(v)) is 
often written: 

V 

The obvious interpretation £ of £ in a structure is the variadic addition 
function £^(ao, . . . , a„) = ao + ■ ■ ■ + a n . 

Throughout the paper, if ^ is a structure, then an assignment shall mean 
a function which maps variables to elements of the universe of If s is an 
assignment and n £ N, I will write s(x\n) for the assignment which agrees with 
s everywhere except that it maps x to n. 

We would like to define the interpretation of a term in a model by an as- 
signment. This is straightforward in classic logic but when variadic terms are 
introduced, interpretation becomes more subtle. There are actually two possible 
definitions; they are equivalent, but to show it, we will first need to establish a 
substitution lemma for one of the two. 

When naively defining a numerical value for X)i=o M (*)' wnere u {i) an d v 
are mathematical expressions, we implicitly use a definition by recursion on 
expression complexity, as each summand u{i) may itself involve nested summa- 
tions. The process terminates because each summand u(i) is strictly simpler 
than y^.Q u(i), which is true because i itself is not a compound expression but 
a natural number. Now, we'd like to say 5Z i=0 u(i) = u(0) + - ■ - + u(v), where the 
summands on the right are recursively computed using the definition currently 
being made. But there are two ways to get here formally: 
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1. (Syntactic) Write a list of v + 1 terms, the nth of which is obtained by 
syntactically replacing the variable i in u by the constant n. Recursively 
compute and add each of these new terms, using the same values for 
variables as we're currently using. 

2. (Semantic) Write a list of v + 1 terms, each of which is exactly u. Re- 
cursively compute and add them, but when computing the nth one, do it 
assuming the value of variable i is n. 

This motivates the following definition of two interpretations (in Corollary [TU] 
we will see that both interpretations are equivalent). 

Definition 3. (Term interpretation) Let ^# be a structure for a variadic lan- 
guage Jzf, and let s be an assignment. Assume we've defined natural number 
interpretations u s , u s i (respectively, syntactic and semantic interpretations of 
u) for every assignment s' and every term u strictly simpler than t. We define 
t s and t s inductively according to the following cases: 



1. If t is a constant symbol c, then t" = t s — c . 

2. If t is a variable x, then t s = t s = s(x). 

3. If t is f(m, . . . ,u k ), then t s = f M (u\ % . . . ,u s k ) and t s = f^(u ls , . . .,u ks ). 

4. If t is G(u(0) ■ ■ - x u(vj), then 



Here v s denotes the constant symbol corresponding to the natural v s . 

Example 3. To illustrate the definition, assume v s — v s — 5. Then by defini- 
tion 



G(u(0) ■ ■ - x u(v)) s = G^(u(x\0) s , u(x\5) s ) 

= G^(u(x\0y,u(x\l) s ,u{x\2) s ,u(x\3) s ,u(x\4) s ,u{x\5) s ). 
G(u(0) ■ ■ - x u(v)) s = G-^(u s ( x \ ), u a ( x \ 5 )) 



Remark 4. Definition [3] may seem somewhat suspect because of how it uses 
meta-ellipses to define the semantics of ellipses. If we were forbidden from using 
meta-ellipses to define the semantics of ellipses, there are two approaches we 
could take. One approach would be to use simultaneous induction to simultane- 
ously define interpretations t s and t s and also define sequences (t(x\0) s , . . . , t(x\n) s ) 
and (t a (a;|o)j • ■ • i t s (x\n))- The latter would be defined by induction on n by means 
of concatenation: 



G(u(0) ■ ■ - x u(v)) s 
G{u(0)--- x u(v)) s 




(U s (x\0), U s (x\l), U s (x\2), W s ( x |3), U s ( x | 4 ), U s ( x |5)). 




4 



assuming that (t') s and (i')s' are already denned for every assignment s' and 
every term t' at most as complex as t; meanwhile, t s and t s would be defined 
as in Definition |31 except that we'd let 

G(u(0) ■ ■ ' x u(v)) s = G^({u(x\0) s , . . . , u(x\lFy)), 
G(u(0) • • - x u(v)) s = G M ((u s ( x \o), u s(xlVs} )). 

This approach does not truly use meta-ellipses except as a name; the name 
could be changed without changing the definition. Another alternative approach 
would be to use generalized structures which we'll discuss in Section [6] 

Remark 5. The syntactic part of Definition [3] relics on the fact that u(x\c) 
is strictly simpler than G(u(0) ■ ■ - x u(v)) for any constant symbol c. The min- 
imalist might wonder whether we can treat first-order variadic function sym- 
bols without so many constant symbols, using only the constant symbol 
and the successor function symbol S. Maybe the natural way to translate the 
definition of G(u(0) ■ ■ - x u(v)) s would be to take Definition [3] using numerals 
n = SS . . . S(0). But then u(x\c) would no longer necessarily be simpler than 
G(u(0) ■ ■ - x u(v)), casting doubt on the productiveness of the definition. One 
way around this dilemma would be to define term complexity not as a natural 
number but as an ordinal in e j defining the complexity of G(u(0) ■ ■ - x u(v)) 
to be, say, uj c ( u ) +c ( v \ where c{u) and c{v) are the complexities of u and v. Of 
course, such a radical approach is not necessary, but it is more elegant than other 
solutions to the dilemma, and this author considers it a nice and unexpected 
application of ordinals to syntax. 



3 The Substitution Lemma 

We will deal mainly with syntactic interpretations t 3 . We will obtain a substitu- 
tion lemma for these, and use it to show that the two interpretations are identi- 
cal. The choice is arbitrary: one could also obtain a substitution lemma about 
semantic interpretations and use that to show equality. Once either version of 
the substitution lemma is obtained, and the two interpretations are shown equal, 
the other substitution lemma becomes trivial. In any case, technical lemmas 
are required. 

Lemma 6. Suppose u,t are terms and x,y are variables. 

1. If x is not a free variable of u, then u(x\t) = u. 

2. If x is not a free variable of u, and s is an assignment, then u s does not 
depend on s(x). 

3. If x is not a free variable of u or t, then x is not a free variable of u(y\t). 
Proof. A straightforward induction. □ 
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In first-order logic, substitutability is a property of formulas, but it is not 
needed for terms: if r, t are any terms and x is any variable, then t is sub- 
stitutable for x in r (in first-order logic). This breaks down in variadic logic, 
requiring a notion of substitutability into terms. 

Definition 4. Fix a term t and a variable x. The substitutability of t for x in 
a term r is defined inductively: 

• If r is a variable or constant symbol, then t is substitutable for x in r. 

• If r is f(u\, . . . , u n ) where U\, . . . ,u n are terms and / is an n-ary or vari- 
adic function symbol, then t is substitutable for x in r if and only if t is 
substitutable for x in all the itj. 

• If r is G(u(0) • • •yu(u)) where G is a variadic function symbol, u,v are 
terms, and y is a variable (which may or may not be x), then t is substi- 
tutable for x in r if at least one of the following hold: 

— x is not a free variable of r, or 

— y = x and t is substitutable for x in v, or 

— y is not a free variable of t and t is substitutable for x in both u and 
u. 

For a non-substitutability example, consider the term X) J=o x ' V ano - try 
substituting t = y for x. The result is J2y=o V ' f ' which is n0 good since the 
new occurrence of y becomes bound by the summation. 

We define substitutability for formulas in the usual way, with just one change: 
if p is a predicate symbol (or =), and u\, . . . ,u n are terms, we say t is substi- 
tutable for x in p(ui, . . . , u n ) if and only if t is substitutable for x in each it,-. 

Lemma 7. Suppose t is a term which is substitutable for the variable x in 
G(u(0) ■ ■ - y u(v)). Then t is substitutable for x in u. And if x ^ y, then t is 
substitutable for x in u. 

Proof. By induction. □ 

Lemma 8. Suppose r, < are terms, x, y are distinct variables, and c is a constant 
symbol. If t is substitutable for x in r, and y does not occur free in t, then 
r(x\t)(y\c) = r(y\c)(x\t). 

Proof. By induction on complexity of r. If r is a constant symbol or f(r\, . . . , r n ) 
for some function symbol / and terms r% , . . . , r n (wherein t is substitutable for 
x), the lemma is clear by induction. If r is a variable, the claim follows since 
y does not occur free in t. But suppose r is G(u(0) • • - z u{v)) for some variadic 
function symbol G, terms u,v, and variable z (which may be x, y, or neither). 
Since t is substitutable for x in r, at least one of the following holds: x is not 
free in r; or z = ac and t is substitutable for a; in v; or £ is substitutable for x 
in it and v. If x is not free in r, then the lemma follows from Lemma [6] But 
suppose x is free in r. Unravelling definitions: 



G 



Hz... 


Then r{x\t) equals... 


— X 


G(u(0) •••,«(«(*!<))) 


^x 


G{u{x\t){Q)--- z u{x\t){v{x\t))) 


If z... 


Then r(x \t){y\c) equals... 


— X 


G(u(y\c)(0)--- z u(y\c)(v(x\t)(y\c))) 


= V 


G(u(x\t)(0)--- z u(x\t)(v(x\t)(y\c))) 


& { x ,y} 


G(u(x\t)(y\c)(0) ■ ■ - z u(x\t)(y\c)(v(x\t)(y\c))) 



By Lemma [3 t is substitutable for x in v, so v(x\t)(y\c) — v(y\c){x\t) by 
induction. And if z ^ x, then Lemma [7] tells us t is substitutable for x in u as 
well, and so by induction u(x\t)(y\c) — u(y\c)(x\t). The lemma follows by using 
these facts to rewrite the last row of the table and compare with a similar table 
for r(y\c)(x\t). □ 

Theorem 9. (The Variadic Substitution Lemma for Terms) Let ^# be a struc- 
ture for a variadic language Jz? and let s be an assignment. If r and t are terms 
such that t is substitutable for x in r, then r(x\t) s — r s ^ 1 '. 

Proof. We induct on the complexity of r, and most cases are straightforward. If 
x is not free in r, the claim is trivial; assume x is free in r. The two important 
cases follow. 

We must show G(u(0) ■ ■ - y u{v))(x\t) s = G(u(0) ■ ■ - y u(v)) s( -^ t! "> when y is a 
different variable than x and t is substitutable for x in G(u(0) ■ ■ - y u(v))(x\t). 
Using induction: 

G(u(0) ■ ■ - y u(v))(x\t) s = G(u(x\t)(0) ■ ■ - y u(x\t)(v(x\t))) s 



u(x\t)(y\0) s ,...,u{x\t) (y\v(x\ty) ) 
u(y\0)(x\t) s , ...,u(y |«(x|t)*) {x\i) s ) (*) 
( u{y\Q) s{x ^ \ ...,u(y v'W) ) 



= G 
= G 

= G 



To reach line (*), we need the fact that u(x\t)(y\c) — u(y\c)(x\t) for any 
constant symbol c. If x is not free in r, then it's not free in u (since y ^ x), and 
so this follows from Lemma [6] Otherwise, since t is substitutable for x in r by 
Lemma we must have that y does not occur free in t, and so we can invoke 
Lemma [5] 

We must also show G(u(0) ■ ■ - x u(v)){x\t) s = G{u{0) ■ ■ - x u(v)) s ( x l* s ) when t 
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is substitutable for x in G(u(0) • • > x u(v)){x\t). Using induction 

G(u(0) • • - x u(v))(x\t) s = G(u(0) ■ ■ - x u(v(x\t))Y 
= G M (u(x\0) s ,...,u(x 

= G M (u(x\0) s ,...,u(. 



v s{x\t 

i(x\oy {x ^ s \...,u( 

G(u(0)--- x u(v)y^ ts \ 



v s(x\f) 



i(x|t") 



(**) 



In line (**), I am able to change "exponents" from s to s(x\t s ) because the 
terms in question do not depend on x. □ 

Corollary 10. For any term t and assignment s, t s = t s . 

Proof. By induction on t. All cases are immediate except the case when t is 
G(u(Q) ■ ■ - x u(v)). Note that constant symbols are always substitutable and 
write: 



G(u(0) ■ ■ - x u(v)) s = G (u(x\0) s , . . . , u(x\v s ) s ) 
= G M (u s{xl0 \... 7 u s ^ vS ^ 



G u. 



(x|0), 



s(x\v s )) 



G(u(0)--- x u{v)) a 



(By Definition) 

(By Theorem EJ 

(By Induction) 
(By Definition) 



□ 

First-order formulas over a variadic language are now defined in the obvious 
way. By Corollary [TU1 we can define ^ \= t = r[s] if and only if t s — r s , or 
equivalently t s = r s ; that is, we are saved from having to make an arbitrary 
choice. The remaining semantics are defined inductively in exactly the same 
way they are for first-order logic. If ^# is a structure, s an assignment, and (j) a 
formula, then ^# |= (f>[s] is defined in the usual way from the above atomic case, 
and ^# \= <p means that ^# |= (j)[s] for every assignment s. Term substitution 
in a formula is defined as usual. Substitutability of a term for a variable in a 
formula is defined as usual, except that in the atomic case, we say t is substi- 
tutable for x in r = q if and only if t is substitutable for x in r and q (in the 
sense of Definition HJ. 

Corollary 11. (The Variadic Substitution Lemma) If t is a term which is sub- 
stitutable for the variable x in the formula <f>, and s is an assignment and ^# a 
structure, then ^ |= 0(x|f)[s] if and only if ^# |= 0[s(a:|i s )]. 

Proof. The proof is identical to the proof of the first-order substitution lemma, 
except that Theorem [9] is invoked for the atomic case. □ 
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Example 12. Working in an appropriate language and structure, it can be 
shown that 



x=0 

showing that it is safe to use the same variable in different roles, so long as we 
use Definition [3] to be completely clear what the truth of the formula means. 

4 Bounded Quantifier Elimination 

In this section, we shall assume our languages have no predicate symbols. If a 
language has a binary function symbol < and a constant symbol 1, I will write 
u < v to abbreviate < (u, v) = 1. 

Definition 5. 

• If Jz? is a variadic language, the quantifier-free formulas of Jz? are defined 
inductively: <p is quantifier-free whenever cp is atomic; and if cp and ip are 
quantifier-free, then so are <j> A ip, <p V ip, <j> — > ip, cp -H- ip, and -■</>. 

• If Jz? contains a binary function symbol < and constant symbol 1, the 
unbounded- quantifier- free (or uqf) formulas of Jz? are defined inductively: 
cp is uqf whenever <p is atomic; and if <f> and tp are uqf and x, y are distinct 
variables, then cj) A ip, <j> V ip, <p — > ip, (p ip, ^<P, 3x (x < y A (p), and 
Va: (x < y — > <f) are also uqf. 

Proposition 13. (Bounded Quantifier Elimination) Suppose Jzf is a variadic 
language containing (possibly among other things) binary function symbols <, 
+ and 8, and a variadic function symbol G. Suppose ^# is an Jzf-model which 
interprets + as addition and interprets <, S, and G by 



For any uqf Jzf-formula (p, there is a quantifier-free Jzf-formula ip, with the same 
free variables as (p, such that ^ |= <p> -H- ip. 

Proof. I will show more strongly that for any uqf formula cp, there is a term 
t^, with exactly the free variables of (p, such that \= <p ^ (t^ = 1) and 
^# ^ -i</> O (<0 = 0). This is by induction on cp: 

• If (p is u = v, take t^ = S(u, v). 

• If (p is ip A p, take t^ = 8{t^ + t p , 2). 

• If cp is ~^ip, take t<f, = S(t^,0). 



X 
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• If (j) is 3a: (x < y A ip) where x ^ y, take t$ — G{t^(Q) ■ ■ - x t.^{y)). 

• All other cases for (j> are reduced to the above by basic logic (there is no 
predicate case by assumption). 

In all but the 3 case, it is routine to check ^ |= (j> ■<->■ (t$ = 1), ^ |= -i0 <-> (t^, — 
0). The 3 case goes as follows. Assume <p is 3x (x < yAi/j), y ^ x. By induction, 
jg |= ip o (tj, = 1) and 1 ^ h = 0). So ^£ \= -0 o = l)[.s] for 
every assignment s. Let s be an assignment. Then: 

^ \= <t> [s] iff 
^ |= 3x (x < y A VO [s] iff 
3n e N s.t. |= a; < y A V> [s(a;|ra)] iff 
3n < s(y) s.t. (= fy, = 1 [s(x|n)] iff 

3n < s(y) s.t. tf^ n) = 1 iff 

G^ (tf^\...,tf Hv)) ) -1 iff 

G^ (^(xlO)^,...,^^^) 5 ) =1 iff 

Jt \= ©(^(O)--..^^)) = l[a] 

In line (*) we use the fact s(x\n)(y) = s(y) since y =/= x. In line (**) we invoke 
the Variadic Substitution Lemma (noting that constant symbols are always 
substitutable for x). □ 

Corollary 14. Let Jz? be the language with constant symbols n for all n G N, 
binary function symbols +, ■, 5 and <, and a variadic function symbol G. Let 
^# be the model which interprets everything in the obvious way (interpreting S 
and G as above) . A set X C N is computably enumerable if and only if there is 
a quantifier- free ££- formula (f>, with free variables a subset of {x, y}, such that 
for all n G N, n 6 1 H |= 3y ^(x|n). 

Proof. Let ^f = -£?\{G} be the first-order part of if. Assume A C N is c.e. By 
computability theory, there is a uqf formula 0o of ,£?o with the desired properties. 
The corollary follows by Bounded Quantifier Elimination. The converse is clear 
by Church's Thesis. □ 



(*) 
(**) 



5 Defining Borel Sets 

I will further extend the concept of variadic function symbols, and apply the 
idea to show that a certain language can define any 53° or 11° subset of N N 
with a formula of complexity £„ or n„ (respectively) , in a rather nice way. My 
interest in using powerful language to define Borel sets is partially influenced 
by Vanden Boom [13], pp. 276-277. 

By an extended variadic language I mean a first-order language together 
with various n-ary-by-variadic function symbols (for various n > 0), as well as 
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constant symbols n for every n € N and ellipses • • - x . A structure for an extended 
variadic language is a structure ^ for the first-order part, with universe N and 
which interprets each n as n, together with a function G*^ : N™ x N <N — > N 
for every n-ary-by- variadic function symbol G. Terms, term substitution, term 
interpretation, and term substitutability are defined in ways very similar to our 
work in Section 2, and the Variadic Substitution Lemma is proved in almost an 
identical way. 

Definition 6. Let (} be the empty sequence. 

• By J^Bor I mean the extended variadic language with a special unary func- 
tion symbol f along with, for every n > and every i : N™ — > N <N \{(}}, 
an n-ary function symbol £ b and an n-ary- by- variadic function symbol t l . 

• For any / : N — > N, is the J^Bor structure which interprets f as / and 
which, for any n > and i : N™ — > N <N \{()}, interprets 



• If is an JzfB or -sentence and S C N N , say that 4> defines S if, for every 
/ : N -> N, / e S if and only if Jt s \= cj>. 

Theorem 15. Let n > and let S C N N . Then S is £° (rcsp. n°) if and only 
if S is defined by a S„ (resp. n„) sentence of S^Bor- 

Proof. Obvious if S — or S = N N , assume not. If / is a finite sequence of 
naturals, I'll write [/o] for the set of infinite extensions of fo- 

(=>) Assume S is If n is odd, we can write S = Ui l£ N • • • Ui n£ N »„] 
where the {/»!-..»„} are finite, nonempty sequences. If n is even, we can write 
S = U neN • • • n 4 „ eN [/ 4l ... 4 „] c . Let t : N" ->■ N <N \{(}} be the map which sends 
(ii,...,i„) to /i!...^. Let / : N — >• N. For any (ii,...,i„), / extends 
if and only if r t / (ii, . . . , i n , /(0), . . . , f(£ L ' (h, . . . , i n ))) = 1. So if n is odd, 
then / e 5 iff 

J(f \= 3xi • • • 3x„r t (xi, . . .,x n ,f(z)(0) ■ ■ - z t(z)(£ L (xi,. . .,£„))) = 1. 

And if n is even, then / £ S iff 

|= 3xi • • -Vx„T t (xi, ...,x n ,f(z)(0)-- ■ z f(z)(£ L (x 1} . . . ,x n ))) = 0. 

The 11° case is similar. 

(<=) By induction on n. For the base case, first use an induction argument 
on formula complexity to show that if is a quantifier-free sentence of J^Bor 
and |= 4> t nen there is some k so big that whenever g : N — > N extends 
(/(0), . . . , f(k)), then „# s |= 0. Thus a set defined by a quantifier-free formula 
is open, hence clopen since its complement is also defined by that formula's 
negation. The base case follows: for example, if S is defined by a sentence 3x 0, 
then (by Variadic Substitution) S = U ie ^{g : N -> N : Ji g \= <p(x\i)}, a set 
since each unionand is clopen. The induction case is straightforward. □ 



£ L f (ai, . . . , a n ) = the length of i{a\, . . . , a„), minus 1 



(oi, . . . , a n , bi, . . . , b m ) = { Qot 
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6 A Partial Mechanization 



We partially automated Sections [2] and [3] using the Coq proof assistant [2] . In 
Coq, it is easier to work with the semantic, rather than the syntactic, term 
interpretations of Definition [3] This is because semantic term interpretation is 
recursive in a direct structural way: to interpret a term, one needs only interpret 
direct subterms. This is in contrast with syntactic term interpretation, which is 
recursive in term depth. To syntactically interpret a term, one must interpret 
terms which are not direct subterms. This makes it much more tedious to 
automate proofs about syntactic interpretations, so our automation primarily 
deals with semantic interpretations. We do, however, automate Corollarv llOl in 
light of which, the distinction disappears. 

Very often when automating mathematics, it is actually easier to prove a 
stronger result. This is certainly the case here. By a generalized structure ^# 
for a variadic language Jzf we mean a structure for the first-order part of Jzf, 
together with a set of interpretations G^ : N N x N — » N for each variadic 
function symbol G of Jzf. This is a generalization in an obvious way: given 
a structure ^M' as in Section [2J there corresponds a generalized structure ^ 
which agrees with on the first-order part of Jzf and is otherwise defined by 

G-*(f,v) = G-*\f(Q),...,f(v)) 

whenever G is a variadic function symbol, s£N, and / 6 N N . The syntactic 
and semantic interpretations in ^# of a term G(u(0) ■ ■ •% u(v)) by an assignment 
s are, respectively, 

G(u(0) ■ ■ - x u(v)Y = G M {k^ u(x\k) s ,v s ), 

G(u(0) ■ ■ - x u(v)) s = G J( {k M s ( a |fe),U s ). 

All the results of Sections[5]and[3]generalize accordingly. It is easier to automate 
these stronger results because Coq has better built-in support for working with 
functions N — > N than for working with finite sequences. 

For even further simplicity, we also assume that all functions are either vari- 
adic or binary, we assume the special constant symbols c are the only constant 
symbols in the language, and we assume there are no predicate symbols. 

Syntactic term interpretation seems to lie on the border of what Coq can 
handle. Coq cannot automatically detect that the definition is total. We are 
able to convince Coq of its totality using an experimental feature of Coq called 
Program Fixpoint (Sozeau [12]). Chung-Kil Hur ([8], [9]) helped us tremen- 
dously with the details of getting Program Fixpoint to work. 

In performing this partial mechanization, we were influenced by R. O'Connor's 
mechanization of ordinary first-order logic |llj . 

7 Future Work 

There are several directions to take this study from here. For one thing, Sec- 
tions [2] and [3] could easily be extended to other types of logic. In order to inject 
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variadic terms into a logic, there are two basic requirements: first, that function 
terms make any sense at all in that logic; second, that the logic has a semantics 
which plays well with variadic function symbols, especially the ellipsis. Some 
potential logics where we could add variadic function symbols include second- 
order logic, more general multi-sorted logic, and nominal logic, just to name 
three. The question is not so much whether the machinery can be added to the 
logic, but rather, what interesting applications result? 

In the direction of multi-sorted logic, we could deal with semantics where 
one sort ranges over (say) K and another ranges over N, and thereby rigorously 
study variadic functions living in the real numbers (single-sorted first-order logic 
falls short here: how are we to interpret a term like Y^i=o 

One of the shortcomings of this first-order treatment is that we were not 
able to give what should be a basic example: the general Apply function from 
computer science. If G : N <N — > N is a variadic function and m, . . . , rife £ N then 
Apply(G, m, . . . , rife) is defined to be G{n\, . . . , rife). This could be formalized 
using our variadic machinery in various typed logics where it makes sense to 
have a function symbol whose "arity" is some cartesian product of types. 

Another direction we can go from here is to consider function symbols of 
infinite arity. The basic idea is that if G is an infinitary function symbol in a 
language and u is a term and x a variable, then G(u(0) ■ ■ - x ) is another term, 
whose intended interpretation by a model M and assignment s is 

g(u(o) • • - x y = G^(u( x \oy, u( x \iy, . . .), 

where G itself is interpreted as some infinitary G** : N N — > N. In fact, much of 
the work needed for this is already done in the Coq mechanization of Section [SJ 
The reason that this direction would be exciting is that the bounded-quantifier 
elimination of Section |4] could be strengthened to full quantifier-elimination. 

Finally, we are interested in embedding the hydra game of Kirby and Paris 
[TO] (a short and very readable introduction is given by Bauer [3]) into term 
interpretation. A binary operator + is (left and right) self-distributive if it 
satisfies a + (b + c) — (a + b) + (a + c) and (a + b) + c = (a + c) + (b + c) (self- 
distributive operators were studied by Frink [5] and more recently (left-sided 
only) by set theorists and knot theorists, as surveyed by Dehornoy [5]). For 
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such an operator (assuming also associativity), 



E *i + • • • + E ** 

ti=0 ik=0 

V\ Vk—l 

= E *! + •••+ E *fc-i+*fcfei°) + ■••+**(**!«*) 

=o »is-i=a 

E *!+••-+ E i -+- *fc(**io) 

jfe-l=0 y 



iii=0 it_i=0 



which bears a certain resemblance to the act of cutting a hydra's head and 
having many isomorphic copies of its subtree regrow. 
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